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Students taking courses in statistic^ (theocy and/or methods) typically have iifficulty 
learning the details of the methods for describing and analyzing quantitative data. They 
have more difficulty understanding the theory behind the lethods- 

'Jnf or* ui.ately this condition leads to the use of rote memory and generates little 
enthusiasm for the"-" potential of statistical methods to. supply useful information and 
facili^iat'^ -decision making. In the pre-compu+er era, Me used a varietjy of visual ^ids in the 
classroom ' generate =?nthusiasB and intcrpc;^ among the students. Fot example, "2 X 2" or 
■^Smm proi*-i:tor slid=?s and Vu-Graph transparencies were created to show the student some of 
♦^he basic ideas in time series analysis, correlation and regression, and probability. In an 
attemDr to clarify the problems of. variati'^n, sampling errors, and inference, populations of 
various colorpi beads and wooden sampling pa^idlefi were used by instructors for classroom 
'Jemons*^ ra ti on. With some ingenuity a teachsir could use the bead population and samples for 
shedfling f" rhec light on.the cause of the basic problem in statistics — var iat ion ( vithi n and 
among pom tz' r-ns, within and among sampi ?s, fjnd between sample statistics and ropulation 
parameter.-^) . ' for =*xairple, the cigar boxes of beads and the paddle were used to generate real 
'-iin'^ sampling da*a from known p'-i pulati on s. The data were used fcr problems invo^'ving chi- 
squac?, analysis ofo va l" iance, and correlation and regression- Althojgh better *han using 
purely static data, these methods were obviously, inf lex ible (f.g-, rize and parameters of 
population, and size of sample were not &sily Ch*l^ged) . In addition,; it was no*- wise to 
oermit hundreis oAundergradua te students +o "p^ay" with the beads of controlled populations 
because of the ikBflihood of the populations netting cnt of control- Another major 
limita*-ion of pre^romputer teachina aids was that they provided no relief from the frequently 
Uffirult, mono* on' US, and time consuming tc^^k of data analysis- Fish bowls, cigar boxes of 
beads, and wooden paddles could help in generati,ng dynamic typ*- data, but the necessity for 
manual processina of data gr<>atly limited th ^ oppoVtunity to experiment wi*h and analyze 
diff?r?nt kinds ani sizes of populations:, and different sizes and types of samples, 

ahen ele":tronic computers became available on the campus th^^'" writer began to experiment 
with "rhe .problem of creating populations of various kinds, selecting random samples (with or 
without replacement), and analyzing and summarizing the results of thousands of samples- K 
basic *'tonl" n-^^^de^ is a random number geMpra^"or, (uniformly distributed from zero to one)[1] 
from which a v;»riety of differ- nt kind-3 of ranira numbers (continuous and discrete) can be 
generated. .Th' j.i -at potential of the elec + ronic computer for dynamic creation and analysis 
of da*a from populations and sampl^^s becamf* dppar>^nt. 

In 1960 the writer had *h*- opportunity to use a medium size electronic computer (the 
TJNIVA: SSeO at th^^ Uriversity of Alabama) fnr- cldrsroom instruction. At that time he was 
teaching both un'^ ergrad ua te courses in basin^ ss statistics and a course in statistical 
quality control, Tn the quality control course, the computer was used to simulate the 
classl::al fish bo»; iixperim^nts of Ualt3r Sheyhart[2]- With the aid of the computer and a 
randoflj number generator, all of Shewhart»s finite populations (normal, triangular, and 
rectangular) wer«» created and thousands of random samples cf various sizes were selected and 
analyzed to verif/ the validity of the quali+y cortrol charts (and the underlying central 
limit theorem). This was the real beginning of the writer's interes;t in having students use 
the electronic computer for conducting exppriments to verify concepts, to prove theorems, and 
to explore relationships not p:-eviously exr^^^ined by theorems. ..^ 

SPATS - statist ical ELograra to assist in teachi n^ §t^i.i§ti£§ 

In 1967 the writf^r had the opportunity to develop i master computer program to be used 
easily by students for generating and analyzing various )finds of populations and samples. 
This program is called SPATS (Statistical Program to Assist in Teaching Statistics) and has 
been used by both graduate and undergraduate^ stuaents. Briefly, SPATS consists of a series 
of PDRTRAN subroutines which porrait the user to create, analyze, organize, and present the 
data fDr a variety of different kinds of populations (continuous and discrete) and random 
sampl33 (with or without r^^ placement) , The numb'=*r and size of populations one can create and 
stor? is dependent upon the specific hardware used; originally SPATS was written in FORTRAJI 



'filmed 'from best available copy 



<f- ' - 

I? far the IBfi 360/ttO, and provided for the creation and storage of up to 10 different 

populations of 1000 iteits.or less, and up to 6 different randoi sasple^ of 500 iteis or 1<}68« 

The analyst can W printout of the individual population or saiple iteis J>f indicating 

that choice in th<) argument list. One of the icst poy^rful features of SPATS is that 

ndserDus randoi saiples (vith or without replaceient] can easily be selected, analyzed, 
suilarized, and presented for inspection. 



Td illustrate the siaplicity of using SPATS for creating various Kinds of populations 
nnd salectinq and analysing randon samples of various sizes from each of these populations, 
th« following short FORTRAN program is submitted. This program is sufficient to re-create 
(and Improve upon) [31 the clissical experiments of Shewhart in which he created two 
populations (normal, and rectangular) and selected 1000 random samples (wit)» replacement) of 
four Items each. The power of this sort of program for producing information is indicated by 
the extent of the analysis obtained from statements number 30 and UO. In addition, with a 
few modifications' the Shewhart experiment can be greatly expanded to include many more 
diffpcant Kinds and sizes of populations and samples. 

ST AT EHK N I^NOflgER F OEIE Afi.ST AT BB BfiT 

10 CALL HVPOPJ (3, 0., 1., 998, 1) 

20 CALL MBPOPJ (1, -3., 3., 998, 2) 

30 CALL SAMPLE <t, 998, 2, 1, 1000) 

UO CALL SAWPL2 (2, 998, U, 2, 1 , 1000) 

50 STOP 

60 END 

Motes; 

1, Statement number 10 c:reates and stores a normal population of 998 
items with a mean of 0. and a standard deviation of 1, 

2, Statement number 20 creates a rectangular population of 998 items 
ranging from *3. to ♦S. 

3, 5tatenen^ number 30 selects 1000 random samples of size ^, with 

replacement, from the normal population and analyzes the population ^ 
' and sample data by computing the following measurements: 

a. From population 

1) nean 

2) Variance 

3) Standard B^viation 

4) Standard error of sample mean 

b. Prom each sample 

1) Sample mean, variance, standard deviaticn, standard error 
of sample mean, confidence limits for population mean; 

2) Unbiased estimates of variance, standard deviation, and 
standard error of estimat*^^ 

3) List of each item in random sample (optional) 

c. From summary of all samples ^ 

1) Average sample mean, average variance, average standard 
deviation, average standard error of sample mea nc; 

2) Unbiased «^stimates of each of the above averages 

3) Measures of dispersion from the experimental distributions 
of sample sta ti sties such as: 

a) Variance and standard deviation of sample means 

bj Variance and standard deviation of sample variances 

c) Variance of standard errors of estimate 
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EXHIBIT I 

Laboratory Proble* No. 2 ^ 
BxperinGntal Study of Samples ^ Sample Statistics, and Sample Distributions" 

A. Introduction to problem 

rhe experiaent in laborato ry * problem no. 1 vas concerned with populations — the problem 
of designing, creating^ analyzing, and studying different kinds of populations. 

Tn this laboratory problem the focal point is on samples, and the experiaent is designed 
1^0 permit the student to design and create random samples, and to study experimentally the 
StiEiiiS^isliSl of iaiEl§ §t^tisii£s and sample distributions in the light of known 
characteristics.' * 

rfiis kind of exploration (roads feasible by SPATS and the electronic computer) should 
provide a better understanding and appreciation of sampling theory, and the basic concepts of 
randoD behavior. 

B. Perform the following operations: 

1. Design, create, and store two different kinds of populations. You may use the 
parameters in laboratory problem no. 1, if you desire, but increase the size of 
each of *he populations to 1000. 

2. (Optional) . Print the. data from the two populations. 

3. a) CoM>Jte the first four moments for each population. Construct a table 

spftmarizing for each population the following measurements; mean, variance, 
) jcoefficient of skewness, and coefficient of kurtosis. 

b) ^ Construct a frequency distribution for each popu lat ion. Use arguments for 
PPED as suggested in (5) of lab problem No. 1. 

Take four . randon samples of the indicated size of each of the two populations. 
(Use printing code, ^p = 2, in SAMPLE) 

5 with replacement 

5 without replacement 

50 with replacement 

50 without r ep lac erne n t 

5*. Take 100 random samples of sizs 50 each from your populations. Print only the 
individual statistics and the final averages (IP = 1) 

6. Take '000 random samples of siz*> 5 each from your normal population. Print only 
the final averages and mevisurv-i^ of dispersion (IP = 0). Repeat using sample size = 
50. 

HDte: As a by-product of this use of the subprogram SAMPLE, the following sample 
statistics have been computed and stored. 

Sample Statistics Column in XX Matrix 



m ea n • 7 

variance 8 

standard deviation 9 
standard error of 

estimate 10 



Phus, you will create at this point in the experiment — listrifeutiaQ? sample 
statistics or "sampling distributions" fU, p.1i*2]. These dist ributions"will"be in the 
form of arrays which can then be summarized^ and analyzed as desired. 



1, It is now desired to Bake an additional analysis of selected statistics. 

a) Compute the first four moiients for each of the f cur saa2li,na distributions 
listed in the note in item 6. " 

b) Construct a frequency distribution fro« each array cf §§521 in^ distributions. 
Us*? class liaits of your choice. 

Repeat steps 6 and 7 using your second EQEulatign (the non-noraal one) in place of 
t h0 nor»al population. Use samples of size 5 itets or 50 ite'*s« 

Wdw, based on your experiments in Part B, answer the following questions: 

(Chapter Q, Spiegel, is a good reference) 

1) CoBiaent on your observations froa the suamary table in B 3(a). (Are the 
peasurements in line with the characteristics you expected based on your work in 
lab problen no, 1?) 

2) Coaiient on your observations froa the frequency distributions constructed in B 
3(b). (Is the distribution in each case about what you expected?) 

3) What effect does sampling ifith or without replacenent have on the conputation of 
sauple statistics? Give an example from your data. ^ 

U) a) Formulate a definition of "68% and 95% confidence limits*', Tesr this 
definition with your experimental data from E (5> , 

b) Formulate a definition of biased and UTi^biased estimates. Give two examples of 
each kind of estimate from your experimental data obtained in B (S) , 

c) Examine the results of B (6) and comment on the observed averages of biased 
and unbiased statistics. How do they compare with the corresponding 
popu lat ion parameters?' 

5) Formulate a definition of the central limit theorem (see Spiegel, p. 1U2 and/or 
other references). Test the validity of this theorem with the results of your 
computations in B, parts (6), (7), and (8). (Hint: F.xamine ^h^ distributions of 
sample means from each cf the two populations for both small and large samples; 
examine the theoretical and experimental standard errors of sample means) - 

Does your experimental data support the theorem? Defend your answer. 

5) Pecord the following statistics from your calculations in B (6) and test these 
experimental results against the theoretical values [ a, p.14U]. 

a) Standard deviation of sample means 

b) Standard deviation of sample variances 

c) Standard deviation of sample standard deviations, 
rommer.t on your observations. 

Note: Unless specifically irdicated, samples may be sel«=cted vith or Hithout 
replacement. 
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EXHIBIT II 

o 

POFTPAK IV Hain Prograi for Lab Problei Mo. Osx|iq SPATS 
STXTGHEtlT REHARKS 



ZhLL MVP0PJ(2, 70/, 10,, tOOO, 2) 
CRLL *1VP0PJ(1, UO., 100,, 1000, n 
PRINT {2, U00,2) 
DO 3 J=1, 2 

TALL MOnPS 1000, 100,, 2, JO 
BOT = XBAR - 3.*SDEV 
TOP = XBAR ♦ 3.*SDEV, 
CI = SDEV/3- 
3 CKLl FRED (BOT, TOP, CI, lOOC, J) 

I* CALL SAMPLE <2, 1000, 5, 2, 0, 1) 
CALL SAHPLE (2, 1000, 5, 2, 1, 1,) 



CALL SAMPLE (2, 1000, 50, 2, 0, 1) 

:ALL SAMPLE (2, 1000, 50, 2, 1, 1) 

5 CALL SAMPLE (2, 1000, 50, 1, 1, 100) 

CALL SAMPLE (1, 1000, 50, 1, 1, 100) 



00 8 MS = 5, 50, as 
00 8 NP = 1, 2 



Creates norial pop* (T'slO 

Creates unifori pop.;A=ao, 6=100 

Prints the 2 populations. 

Computes the first four aoacrts for 
each of the populations and constructs 
a frequency distribution for each 
pop, usLn^ ji and o~ fos constructing 
classes 



Select randan sample of size 5 
froB nocnal pop*, vithout and vith 
replacement, 

same as above except saaole size = 
50, 

Select and analyze 100 random 
samples from normal pop. and from 
un if oris population • 



CALL SAMPLE#(NP, 1000, NS, 0, 1, 500) 
DO 7 J = 7, 10 

CALL MOMPS{a, 500, 100., 2, 0) 

BOT = XBAP - 3,*SDBV 
TOP = XBAR ♦ 3.*SDEV 
CI = SDEV/3. 

CALL FRED {BOT, TOP, CI, 2, 500, J) 



Select and analyze 500 random 

samples of size NS from pop* (Kind = HP)* 

Compute 1st U moments from Jth pop* 

Compute arguments for FRED using 
mean (XBAR) and c (SPB^) from last 
flOMPS. ^; 

Construct freq* distribution for Jth pop,, 
J=7, B, 9, 10; Note: Pop. her^ are actually 
+:he nevly created sa^^ling distributions created 
f r om statement no. 6 ; spe 1 ab prob . no . 2 . 



CONTINUE 

STOP 

END 



Note: For a i)e-run using a different random sfart, add the following statement at the 
beginning of this program: 



MSTABT = 613217 [Or use any other odd number with 8 or less digits] 



Using SPATS in the Classroom 

To illustrate how SPATS may be used in the classroom, an example of a laboratory problem 
is presented in Exhibit I. The SPATS program for performing the required operations appears 
in Biihibit II* The problem included is the second in a set of four which were used in a 
three semester hour course offered at Mississippi State University entitled. Business 
Statistics Using the Computer (required of all undergra dutae majors in business statistics 
and data processing). The prerequisite is six semester hours of business statistics and 
three semester hours of data processing (including FORTRAN). 

The idea of laboratory experimentation with populations ani samples being somewhat 
foreign, the students were slow to catch on to the idea when first explained. However, when 



thej began to see soie tangible exaagl^s of quantlti^^s which previously were FOiewhat 
nebulous, such as di^wrrete and continuous populations, r%j)J^i?» Miples, ponfidenre liiits, 
biased and unbiased estiiates^ and swiplinq distributions* their entbusiasa incr»a5K?d. Soae 
students stated after laboratory ptoblei no. 2 that for the first tiie th^y understood so«e 
concepts and relationships that they had previously depived in iatheiatical statistics 
courses. For example, "95% confid*ence liBits,'^ and biased and unbiased estimates become more 
meaningful when supported by enpirical evidence. 



£l2XiQ9 IbtSrems ^ith SPATS 

Dne further extension of SPATS which has great potential is that of proving theor^ns 
experimentally. This has not been attempt«?d with undergraduates ,but I have tried this 
approach with a small 'group of graduate students in business. The Intent wasto have th* 
studentr. make a parallel, proof of each th-^oren experimentally and analytically, and to 
evaluate the merits of each approach. The analytical prpofs had to be omitted due to lack of 
time md the students' insufficient , mathematical backgrounds. Since the experimental 
appro^wh was new to them, as was the case with the undergraduates in the other course, it was 
more of a challenge than I realized. Instead of having thorn perform all of the experiments 
on the assignment sheet (Exhibit III), it was necessary to settle for no more than six 
lessons. One drawback is that SPATS was not created originally to prove theorems directly* 
Ther^^fDre, it is necessary to use only limit «d subsets of SPATS and a great deal of POPTRAN 
to carry out the experiments. One of the graduate students, however, is now working on a 
5iss?rtation which is concerned with the problem of developing easy to use computing 
"appiratui" ani "materials" (analogous to that usfi in chemistry experiments). It is hoped 
*hat these new materials will Qreatly simplify the problem of designing and carrying out a 
large number of experiments fcr testing statistical theorems, and even exploring 
relationships not supported by ma-chematical proof. 



EXHIRIT III 



A Dozen Additional Statistical Experiments Feasible with SPATS 



1. Prov<^ 'Experimentally that ^r^^ " 

2. r^st experimentally the theory of.^.tivQ mat hemat ici ar D'Aleabert who reasoned that the 
three possible outcomes of a toss of two coins are egually likely. j 

3. Design an 'experiment similar to Halter Shewhart's classical experiment in quality 
control concerning the distribution of sample means where the samples are of various 
sizes ♦'aken from both non-normal and normal populations. 

U. O^sigr an experiment to test xh^ frequency concept of probability. 

5. Perform an experiment to show the influence of exper itnental data on subjectively 
assigned prior probabilities concerning various hypotheses (or events) which are 
Tiutually exclusive and completely exhaustive. This woul^l amount to an ex per imeii t a 1 test 
->! Bayes theorem (see Hosteller, Rourke, and Thomas[5j, p. 1U6). 

6. Prove ^^xper inen* a 11 y that the variance of the sun of n independent random variable is 
♦"he •^um of tb«ir variances. 

L~t C =^ a constant, and X = a random variabl*^. Prove experimentally the following: 

^^(C-X) = • ^MX) 



r(c*x) = (T(X) 

^5(a + bX) = • ^ ^ (X) 

rhebyshev's theorem states that the p-obability is at least (1 -(l/h^)) that a random 
vari'ible from a population falls within h standard deviations from the population mean. 
r=*st this theorem experimentally using several different kinds of random varia^es. 

9. Zh^byshev's theorem applies to the probability of items in a saaple as w^^ll as to the 
random variates from a population. Test experimentally that at least (1 • (1/h^)) of 



the it€«5 in o random sa»ple are within h ^sai£le standard deviations iron th ^ sagpl e 
■ean, \ 



10. Prove experimentally the DeKcivr e-taPlace theorem, let , (i = t, n> be the nuBber of 

successes in a binomial experiment With n trials each with a probability of success = p* 

<• 

Prove that ^ i^n = ((^^ - n p) / frTpq) ^ ^) approaches the area to. the right of z for a 
standard noxaal distribution. V. 

11. Experimentally construct a chi-square distribution vith ■eat = n and variance = 2n. 

1?. Experimentally check the validity of F. N. David's Tables of Correlation coefficients 
(Cambridge Press, 193B) using population correlation ccefficients (rho) = , .1, *2, 
.9; an?l_using sample size n =^ 3 to 25 (increments of 1), and large samples v^ere n = 50, 
10 0, 200, and UOO. 



SP^^?S is a master statistical program developed originally for the purpose of 
simplifying the problem of experimentally creating and analyzing a variety of populations 
(discrete ar.d continuous), and simplifying the f-asX of selecting and analyzing a variety of 
different kfr^s of randoa samples, together with individual data and summary statistics. It 
has been used in the classroom and does give students a better understanding of some of the 
basic concepts and relationships pertaining to populations and samples. 

In the process of developing and using SPATS, the idea^ occurred that some of the basic 
components of SPATS could be used by the students for carrying out statistical experiments to 
prove theorems similar to those experiments used in chemistry to prove Boyle's and Charles' 
law:?. Although experience with this application of computers has not been as extensive as 
oth*-r «*'ore traditional applications, results to date give support to the belief that computer 
assist' d statistical experiment s, will' be a major breakthrough in the process of providing 
studititi with a real understanding of basic statistical principles and theorems. Research is 
-ontv/nuir.g on the problem of sharpening and testing the tools of SPATS so as to make them 
more feasible for unde ror aduat e classroom use. 



1. ivrigjf^ally the mid-square method was used but more recently some f or ■ of the 
liult iplicative congruent ia 1 method has been found to be more satisfactory* 

2. tf. A, Shewhart, Economic Control of Quality of Manufactured Product. Princeton, N. J.: 
0. Van Nostrand CoTT 1931. 

3. Although both Shewhart's and SPATS normal and rectangular populations do not exactly 
iuplicate the theoretical distributions, the SPATS populations are more continuous (less 
discrete); also the r*^ctangular population in SPATS contains 998 items whereas 
Shewhart's contains only 122. f 

^. :mrray R. Spiegel, Theory and Problems of Statistics ■ New York: Schaum Publishing 
Company, 1961. 

5. "hosteller, Roarke, and Thomas, Probability and Statistics Addison-Wesley Publishing Co., 
1961. 



Summary and Conclusions 
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NOTES AND REFEPEHCES 
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